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ABSTRACT 

Recent longitudinal studies of age-dependent 
leukocyte telomere length (LTL) attrition have 
reported that variable proportions of individuals ex- 
perience LTL lengthening. Often, LTL lengthening 
has been taken at face value, and authors have 
speculated about the biological causation of this 
finding. Based on empirical data and theoretical 
considerations, we show that regardless of the 
method used to measure telomere length 
(Southern blot or quantitative polymerase chain 
reaction-based methods), measurement error of 
telomere length and duration of follow-up explain 
almost entirely the absence of age-dependent LTL 
attrition in longitudinal studies. We find that LTL 
lengthening is far less frequent in studies with long 
follow-up periods and those that used a high-preci- 
sion Southern blot method (as compared with quan- 
titative polymerase chain reaction determination, 
which is associated with larger laboratory error). 
We conclude that the LTL lengthening observed in 
longitudinal studies is predominantly, if not entirely, 
an artifact of measurement error, which is 
exacerbated by short follow-up periods. We offer 
specific suggestions for design of longitudinal 
studies of LTL attrition to diminish this artifact. 

INTRODUCTION 

Telomere length (TL) on average declines with age. 
However, longitudinal studies in humans report varying 
proportions of subjects who ostensibly display TL 
lengthening over time (1-12). In this communication, we 
address the following question: does the lengthening of TL 



represent a true biological phenomenon or an artifact 
inherent in the measurement error of TL in relation to 
the duration of the follow-up period? Biological 'noise 1 , 
arising from neither genes nor the environment, is a major 
explanation for the inter-individual variation in pheno- 
typic expressions (13). This form of noise is largely 
beyond our control. However, the impact of noise due 
to measurement error on research findings can be 
minimized both by optimizing assays and improving 
study design. 

As most epidemiological telomere research has focused 
on leukocyte TL (LTL) dynamics (LTL and its age-de- 
pendent attrition), we examined this key question with 
respect to LTL. If age-dependent LTL attrition, which 
ultimately reflects TL shortening in hematopoietic stem 
cells (14), is indeed halted or reversed at any time of the 
human life course, major efforts should be invested to 
explain the etiology of this phenomenon. Reversal of 
LTL attrition could originate from altered telomere 
dynamics at any level of the hematopoietic hierarchy, 
including hematopoietic stem cells at the top and extend- 
ing down through hematopoietic progenitor cells to 
circulating leukocytes at the bottom. However, if LTL 
elongation is an artifact, researchers need to optimize lon- 
gitudinal studies of LTL dynamics (by using both meas- 
urement methods that minimize error and performing 
studies with extended follow-up) so that the overall 
effect of this artifact will be minimized or eliminated. 

To this end, based on empirical data available from 
published longitudinal studies, the duration of which has 
ranged from 6 months to 13.1 years (Table 1), we per- 
formed a series of analyses with a view to further 
explore the relation between change in LTL over time, 
LTL measurement error and the length of the follow-up. 
In discussing our results, we seek potential explanations 
for discrepancies between the published empirical findings 
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Table 1. Empirical and computational data related to longitudinal studies of LTL attrition 
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Year = year of publication; N = sample size; FU = follow-up duration; LTL is expressed in bp for the SB, T/S units for the qPCR method or both, 

if the transformed T/S units are also expressed in absolute LTL; LTLb = LTL at baseline; LTLfu = LTL at follow-up; CV = inter-assay coefficient 

of variation; Rep = inter-assay replicate number (1 is used when this is not provided); Observed G = observed percentage of LTL gainers with 95% 

confidence interval (in brackets); Predicted G = the predicted percentage of LTL gainers during the follow-up. 

"Various subsets of the Bogalusa Heart Study, described in different publications. 

b Same study, but LTL change was measured at different follow-up intervals. 

^Overall, no change in LTL during follow-up, but percentage of LTL not gainers reported. 

d T/S data converted to absolute LTL units (bp). 

e Study provided the value of 'stable or increased' LTL jointly. Half of the individuals with 'stable' LTL could not be computed, based on the 
principle applied to the other studies. Thus, this value might be an overestimate of LTL gainers. 

Baseline and follow-up LTL and T/S was reported to be identical (up to the precision used in the article), thus the change cannot be computed. 
g Study consisted of children aged 5-10 years old. 

h Only the value of LTL gainers (>15% lengthening) was reported; hence, this value might be an underestimate of LTL gainers. 
Blank entries denote parameters not reflect unreported data or not computed owing to insufficient information. 

The percentage of LTL gainers could not be predicted in several studies for the following specific reasons: for (6), the mean change between baseline 
and follow-up LTL was zero within the precision used to report values; (11) reported intra-assay CV for T and S separately, but not inter-assay for 
T/S ratio. For (4), the unusually low 0.9% CV is assumed to be the inter-assay CV. For (3) and (5), no data were provided about LTLb and LTLfu 
to compute Change. 



and the outcomes of these computations. We further 
propose a way forward with respect to the measurement 
of LTL and study design in evaluating human telomere 
dynamics. 

MATERIALS AND METHODS 

Measurement error can lead to observations that LTL 
spuriously seems to become longer over time in a subset 
of individuals even under an assumption that LTL 
undergoes age-dependent attrition in all individuals. This 
can be quantified by denoting n as the true LTL attrition 
over a fixed period and assuming that it is the same for all 
individuals. This assumption does not typically hold in 
epidemiological and biological settings, but we apply it 
here to enable use of the limited information available 
from published papers. A random effects model, which 
includes not only the mean LTL attrition but also a 
term that can capture individual deviations from the 
mean attrition, would give a more precise estimate. 
However, such a model would require the entire data set 
of a given study. 

Because of measurement error, observations of an indi- 
vidual's change in LTL will deviate fromii. We assume 
that the measurement error of LTL is normally distributed 
around the true value of LTL. We denote the standard 
deviation (SD) by a and assume that a is the same for all 



individuals. As measurement error occurs for baseline and 
follow-up samples, the a of the difference between 
baseline and follow-up (cr d ) will be larger than a. From 
the properties of the normal distribution, it follows that 
cr d = crV2- This increase in measurement error can be 
compensated for by performing replicates of baseline 
and follow-up. If, for instance, duplicate measurements 
of both baseline and follow-up are performed (i.e. a 
total of four measurements for each individual), then the 
mean of the two differences will give a more precise 
estimate of the true difference than each of the differences 
on their own. In this case, a d = a. Further increasing the 
number of replicates will increase precision and thus 
decrease a d . 

The two tails of the normal distribution extend to 
infinity. Thus, no matter how large li is and how small a 
is, there will always be a chance that the measured change 
is positive. That said, a larger |i and a smaller a will result 
in fewer individuals who appear to show a longer LTL at 
follow-up (LTL gainers). This is illustrated for different 
normal distributions in Figure 1 . For instance, one distri- 
bution assumes that the true loss n is 300 bp (which could 
correspond to 10 years of follow-up at a loss of 30 bp/ 
year), and that the a d is 1 50 bp (which could correspond 
to ct of 150 bp with two replicates). The red shaded areas 
correspond to the individuals who (purely because of 
measurement error) are observed to have a longer LTL 
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Figure 1. Theoretical curves showing the distribution of measured change in LTL because of measurement error. Red shaded areas denote the 
percentage of individuals who will be misclassified as LTL gainers. Left panels assume a loss of 300 bp and right panels assume a loss of 200 bp 
during follow-up periods, which could correspond to 10 years of follow-up with an attrition of either 30 or 20 bp/year, respectively. Top panels 
assume an SD of the measured difference of 150 bp, and bottom panels assume an SD of the measured difference of 212 bp, which could correspond 
to an SD of the measurement error of 150 bp for two independent measurements or just a single measurement, respectively. As we consider the 
measured change in LTL to be positive for LTL gainers, u (200 or 300 bp) is positioned to the left of 0 on the x-axis. 



at follow-up than at baseline, i.e. LTL gainers. The prob- 
ability of such a misclassification in this case is 2.28%, 
which can be found using computer software or 
standard tables for the normal distribution. 

Unfortunately, the a of the measurement error is typ- 
ically not reported in publications on LTL. Instead, the 
unitless CV of LTL (i.e. the SD divided by the mean of 
LTL) is reported. Therefore, when converting the CV 
(inter-assay or intra-assay) to a to construct the theoret- 
ical curves depicting the relation between the percentage 
of LTL gainers and follow-up duration (Figure 3), we 
assumed a mean LTL of 7 kb (which is equivalent to the 
average LTL in young adults). 

Notably, Epel et al. (6) and Farzaneh-Far et al. (7) 
reported subsets of participants in their studies as 'main- 
tainers' of LTL, i.e. unchanged LTL, based on a difference 
of follow-up LTL from baseline LTL of less than ±15% 
and ± 10%, respectively. Based on these definitions, 46% 
in Epefs study (6) and 32% in Farzaneh-Far's study (7) 
displayed unchanged LTL during the follow-up period. 
For estimating the actual percentage of LTL gainers, we 
assumed that half of these individuals, i.e. 23% in Epel's 
study (6) and 16% in Frazaneh-Far's study (7), were LTL 



gainers. This approach was applied to all studies listed in 
Table 1 , even when the number of individuals reported as 
having unchanged LTL was small. Nordfjall et al. (5) 
reported the percent of individuals with 'stable or 
increased' LTL; thus, the observed 34% of LTL gainers 
(Table 1 and Figure 2) might be an overestimate, based on 
the principle applied to the other studies. In contrast, 
Shalev et al. (11) reported only individuals showing 
>15% LTL lengthening as LTL gainers; thus, the 
observed 16.9% of LTL gainers (Table 1 and Figure 2) 
might be an underestimate, based on the same principle. 
In addition, we could not compute the predicted percent- 
age of LTL gainers in a number of studies because of 
insufficient information, such as absent baseline LTL or 
follow-up LTL and lack of clarity about the CV. Specific 
reasons are also provided in the legend of Table 1 . 

RESULTS 

Based on empirical data derived from the longitudinal 
studies presented in Table 1, the proportion of LTL 
gainers was inversely related to the duration of the 
follow-up, i.e. the shorter the duration, the greater the 
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Figure 2. Percentage of LTL gainers versus follow-up time based on studies presented in Table 1. The number in each symbol denotes the reference 
of the study. *A subset of the same study; "this percentage of LTL gainers is an underestimate, as it does not include half the subgroup with 
'unchanged LTL' (which was not provided by the authors); ***this percentage of LTL gainers is an overestimate, as it includes the entire 'unchanged 
LTL' subgroup. 



proportion of observed LTL gainers (Figure 2). This was 
evident both for Southern blot (SB) analysis and quanti- 
tative polymerase chain reaction (qPCR)-based methods 
of LTL measurement. However, smaller proportions of 
LTL gainers were consistently observed for the SB 
analysis (Figure 2). It could be argued that although in 
the long run LTL undergoes age-dependent shortening in 
all individuals, it experiences true fluctuations up (being 
lengthened) and down (being shortened) during short time 
intervals, as previously suggested (9). To address this pos- 
sibility, we performed computations of the impact of the 
measurement error on the outcome of longitudinal studies 
of LTL attrition. 

Figure 3 was constructed based on the stipulation that 

(i) LTL = 7 kb at baseline, (ii) average rates of age-de- 
pendent LTL attrition of 30 or 20 bp/year and (hi) single 
or duplicate measurements of baseline and of follow-up 
LTLs. The figure showcases the computed percentage of 
individuals misclassified as LTL gainers for different 
measurement errors and follow-up periods. For instance, 
with a CV of 3%, ~8% of individuals would be mis- 
classified as LTL gainers when (i) duplicate measurements 
are performed at baseline and follow-up examinations, 

(ii) the rate of LTL attrition is 30 bp/year and (hi) the 
follow-up duration is 10 years. However, ~20% of indi- 
viduals would be misclassified as LTL gainers under the 
same circumstances when the CV is 5%. (Figure 3, left 
upper panel). 

To further illustrate the impact of the measurement 
error of LTL on findings and conclusions, we focused 
on studies that provided sufficient information (i.e. the 



CV of the LTL measurement, the follow-up duration 
and the rate of LTL attrition based on baseline and 
follow-up LTL measurements) to compute the predicted 
percentage of LTL gainers (Table 1). The predicted pro- 
portion of LTL gainers ranged from 0.0 (4) to 50.0% (9), 
and the observed proportions of LTL gainers ranged from 
3.0 (10) to 50% (9). In one study, the predicted and 
observed proportions of LTL gainers were the same, i.e. 
50% (9). In another sub-study, the predicted percentage of 
LTL gainers was minimally larger (14.8%) than the 
observed value (14.4%) (8). In all other studies or sub- 
studies, the predicted proportions of LTL gainers were 
smaller than the observed values. However, in three of 
these studies, the predicted values (9.6, 1.0 and 1.8%) min- 
imally differed from the observed ones (10.7, 1.5 and 
3.0%, respectively). Thus, in five studies, the predicted 
percentages of those misclassified as LTL gainers 
matched well with the observed LTL gainers. In five 
other studies, the predicted percentages of LTL gainers 
(0.1, 1.0, 0.0, 1.7 and 22.8%), differed substantially from 
the, respectively, observed values (7.9, 11.2, 15.9, 7.5 and 
39%). 



DISCUSSION 

It is baffling that some authors did not consider TL meas- 
urement error as the most parsimonious explanation for 
their finding of LTL lengthening in longitudinal studies. 
Instead they concluded that LTL frequently elongates 
with age and sought a biological meaning for this phe- 
nomenon. Gardner et al. (1) were the first to report LTL 
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Figure 3. Theoretical curves showing the corresponding percentage of LTL gainers for a given follow-up period for different inter-assay CVs. Upper 
panels are based on two independent LTL measurements at baseline and two independent measurements at follow-up, and lower panels on one 
measurement at baseline and follow-up. Left panels display the curves for an average rate of LTL attrition of 30 bp/year. Right panels display the 
curves for an average rate of LTL attrition of 20 bp/year. Keys for trajectories of different CVs are shown in the insets. 



lengthening in a small longitudinal study and postulated 
biological causes for this enigmatic finding. However, a 
recent report concluded that LTL lengthening observed 
in longitudinal studies is essentially an artifact, which is 
largely because of laboratory error in TL measurements in 
relation to the follow-up duration (8). Previously, the 
underlying reasons for this discrepancy were examined 
through empirical evaluation (8) and here through com- 
putations based on published empirical data and theoret- 
ical considerations. 

The tenet that age-dependent LTL attrition might be 
halted during the human life course was originally 
proposed based on cross-sectional studies, which appar- 
ently showed that LTL attrition levels off during early 
adulthood (15,16). However, this claim was not persua- 
sive, as has been made clear by statistical considerations 
(17). With necessarily guarded inference because of the 
limitations of the cross-sectional design, statistical consid- 
erations indicated that because of the wide inter-individual 
variation in LTL at a given age, large sample sizes and 
wide age ranges of the subjects are required in cross- 
sectional studies for a reliable assessment of age-depend- 
ent LTL attrition. Although the rate of LTL shortening is 
more rapid during early life than during adult life 



(15,18,19), based on numerous cross-sectional studies, 
including a large study recently published (19), there is 
no evidence for a pause in LTL shortening during early 
adulthood. 

As we show here, longitudinal evaluations are also 
subject to factors that strongly affect estimates of age- 
dependent LTL attrition, namely, laboratory error, 
number of replicates, the length of follow-up and the 
rate of age-dependent LTL attrition. The duration of the 
follow-up period in relation to the CV of TL measure- 
ments is critical as demonstrated in Figures 1-3, given 
that the magnitude of change in LTL is usually but a 
fraction of the absolute value of LTL (20). 

Three major factors might account for the lower pre- 
dicted estimate of individuals misclassified as LTL gainers. 
First, investigators might underestimate their measure- 
ment error of LTL. The study by Farzaneh-Far et al. 
(Table 1) (7), which showed the largest gap between the 
observed (39% LTL gainers) versus the predicted (22.8% 
of individuals misclassified as such), could serve as an 
illustration. The predicted value of LTL gainers in that 
study was based on its reported inter-assay CV of 3.7% 
for LTL measurements by qPCR, but a recent impartial 
evaluation of the inter-assay CV of LTL in the laboratory 
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performing these measurements was 6.4% (21) (the SB 
method was also evaluated impartially in that study, 
but, as shown in Table 1, its inter-assay CV of 1.7% 
was similar to values reported in longitudinal studies 
using this method). Computation based on a CV of 
6.4% for the study by Farzaneh-Far et al. (7) yielded a 
predicted value of 33.4% of individuals being misclassified 
as LTL gainers, which is much closer to the observed 
value of LTL gainers reported by the authors. Notably, 
the effect of measurement error can be reduced in longi- 
tudinal studies in which the individual's baseline and 
follow-up samples are assayed in the same 'batch' 
(qPCR) or 'gel' (SB). For further details, see 
Supplementary Supplement SI. 

Second, the predicted percentage of LTL gainers is 
based on the assumption that LTL attrition is the same 
for all individuals, which, as noted under the 'Materials 
and Methods' section, is unlikely to be the case. A more 
realistic model to predict the percentage of LTL gainers 
would be a random effects model that accounts for 
between-individual variation in LTL longitudinal 
change. However, published papers rarely provide the 
entire data set that is necessary to apply the random 
effects model, which typically generates a higher predicted 
percentage of LTL gainers. The recently published study 
(12), whose entire data were available to us, serves to il- 
lustrate this concept. Using the model we use here, i.e. 
LTL attrition is the same in all individuals, the predicted 
percentage of LTL gainers was 1.7 (Table 1). However, 
using the random effects model, which includes the mean 
LTL and the individual's specific attrition, the predicted 
percentage of LTL gainers was 10.6. This was much closer 
to the observed 7.5% (95% confidence interval 2.8-15.6) 
of LTL gainers reported in that study (12). For further 
details see Supplementary Supplement S2. 

Third, although laboratory measurement error is a 
major cause, by no means is it the only factor that 
accounts for the misclassification of subjects. The circum- 
stances related to the baseline and follow-up samples are 
also important. In longitudinal studies, blood samples are 
often collected and DNA extracted by different investiga- 
tors and different techniques that might impact on DNA 
integrity and purity. Although the state of DNA integrity 
might affect LTL results generated by SB to a greater 
extent than those generated by qPCR, protein and 
organic solvent impurities in the DNA samples might 
exert a greater impact on the qPCR results than those of 
the SB. Thus, meticulous attention to these potential 
problems should be exercised in longitudinal studies. 

In summary, one cannot totally exclude unusual cir- 
cumstances where an individual might display lengthening 
of LTL because of biological causes. However, as our 
analysis shows, under most circumstances, findings of 
age-dependent LTL lengthening seem to be the outcome 
of a less than optimal study design and LTL measurement 
error that is disproportionally large in relation to the 
extent of LTL attrition during the follow-up period. 
Teasing apart the two factors that fashion LTL during 
the human life course, i.e. LTL at birth and its age- 
dependent shortening afterward, will provide a valuable 
and long-awaited insight into human telomere biology and 



its role in human aging. But this cannot be accomplished 
through longitudinal studies in which the noise of TL de- 
termination because of measurement error exceeds age- 
dependent change in LTL during inadequate durations 
of follow-up. The graphs we present, and the specific il- 
lustrations we provide might serve as useful guidelines for 
researchers in estimating the duration of the follow-up 
period required in relation to the CV of TL measurement. 
These graphs also showcase the fact that in most cases, 
LTL lengthening is not a biologically determined result; it 
is a mathematical artifact related to the measurement 
error. 

Finally, there is a need for impartial large-scale evalu- 
ations of the suitability of current TL measurement 
methods — SB, qPCR, fluorescence in situ hybridization 
and single telomere length analysis (STELA) — for 
clinical/epidemiological research. These evaluations, we 
suggest, can only take place if they are sponsored by 
national funding agencies, e.g. the National Institutes of 
Health in the USA and the Medical Research Council in 
the UK, through partnership with investigators in the 
telomere field. Such agencies are investing millions of 
dollars annually in telomere research, the outcome of a 
large portion of which might be dubious because of 
faulty methodology and experimental design. Without 
the oversight of such agencies, translation of the great 
strides made in basic telomere biology into useful insight 
into aging-related disease, including cardiovascular 
disease and cancer, may be compromised in the foresee- 
able future. 
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